智能论文笔记

Transformers in Remote Sensing: A Survey

Abdulaziz Amer Aleissaee , Amandeep Kumar , Rao Muhammad Anwer , Salman Khan , Hisham Cholakkal , Gui-Song Xia , Fahad Shahbaz khan

分类：计算机视觉

2022-09-02

在过去的十年中，基于深度学习的算法在遥感图像分析的不同领域中广泛流行。最近，最初在自然语言处理中引入的基于变形金刚的体系结构遍布计算机视觉领域，在该字段中，自我发挥的机制已被用作替代流行的卷积操作员来捕获长期依赖性。受到计算机视觉的最新进展的启发，遥感社区还见证了对各种任务的视觉变压器的探索。尽管许多调查都集中在计算机视觉中的变压器上，但据我们所知，我们是第一个对基于遥感中变压器的最新进展进行系统评价的人。我们的调查涵盖了60多种基于变形金刚的60多种方法，用于遥感子方面的不同遥感问题：非常高分辨率（VHR），高光谱（HSI）和合成孔径雷达（SAR）图像。我们通过讨论遥感中变压器的不同挑战和开放问题来结束调查。此外，我们打算在遥感论文中频繁更新和维护最新的变压器，及其各自的代码：https：//github.com/virobo-15/transformer-in-in-remote-sensing

translated by 谷歌翻译

HTML版本

An Adaptive Simulated Annealing-Based Machine Learning Approach for Developing an E-Triage Tool for Hospital Emergency Operations

Abdulaziz Ahmed , Mohammed Al-Maamari , Mohammad Firouz , Dursun Delen

分类：人工智能

2022-12-22

Patient triage at emergency departments (EDs) is necessary to prioritize care for patients with critical and time-sensitive conditions. Different tools are used for patient triage and one of the most common ones is the emergency severity index (ESI), which has a scale of five levels, where level 1 is the most urgent and level 5 is the least urgent. This paper proposes a framework for utilizing machine learning to develop an e-triage tool that can be used at EDs. A large retrospective dataset of ED patient visits is obtained from the electronic health record of a healthcare provider in the Midwest of the US for three years. However, the main challenge of using machine learning algorithms is that most of them have many parameters and without optimizing these parameters, developing a high-performance model is not possible. This paper proposes an approach to optimize the hyperparameters of machine learning. The metaheuristic optimization algorithms simulated annealing (SA) and adaptive simulated annealing (ASA) are proposed to optimize the parameters of extreme gradient boosting (XGB) and categorical boosting (CaB). The newly proposed algorithms are SA-XGB, ASA-XGB, SA-CaB, ASA-CaB. Grid search (GS), which is a traditional approach used for machine learning fine-tunning is also used to fine-tune the parameters of XGB and CaB, which are named GS-XGB and GS-CaB. The six algorithms are trained and tested using eight data groups obtained from the feature selection phase. The results show ASA-CaB outperformed all the proposed algorithms with accuracy, precision, recall, and f1 of 83.3%, 83.2%, 83.3%, 83.2%, respectively.

translated by 谷歌翻译

A Study of Left Before Treatment Complete Emergency Department Patients: An Optimized Explanatory Machine Learning Framework

Abdulaziz Ahmed , Khalid Y. Aram , Salih Tutun

分类：人工智能

2022-12-22

The issue of left before treatment complete (LBTC) patients is common in emergency departments (EDs). This issue represents a medico-legal risk and may cause a revenue loss. Thus, understanding the factors that cause patients to leave before treatment is complete is vital to mitigate and potentially eliminate these adverse effects. This paper proposes a framework for studying the factors that affect LBTC outcomes in EDs. The framework integrates machine learning, metaheuristic optimization, and model interpretation techniques. Metaheuristic optimization is used for hyperparameter optimization--one of the main challenges of machine learning model development. Three metaheuristic optimization algorithms are employed for optimizing the parameters of extreme gradient boosting (XGB), which are simulated annealing (SA), adaptive simulated annealing (ASA), and adaptive tabu simulated annealing (ATSA). The optimized XGB models are used to predict the LBTC outcomes for the patients under treatment in ED. The designed algorithms are trained and tested using four data groups resulting from the feature selection phase. The model with the best predictive performance is interpreted using SHaply Additive exPlanations (SHAP) method. The findings show that ATSA-XGB outperformed other mode configurations with an accuracy, area under the curve (AUC), sensitivity, specificity, and F1-score of 86.61%, 87.50%, 85.71%, 87.51%, and 86.60%, respectively. The degree and the direction of effects of each feature were determined and explained using the SHAP method.

translated by 谷歌翻译

Site Assessment and Layout Optimization for Rooftop Solar Energy Generation in Worldview-3 Imagery

Zeyad Awwad , Abdulaziz Alharbi , Abdulelah H. Habib , Olivier L. de Weck

分类：计算机视觉

2022-12-07

With the growth of residential rooftop PV adoption in recent decades, the problem of 1 effective layout design has become increasingly important in recent years. Although a number 2 of automated methods have been introduced, these tend to rely on simplifying assumptions and 3 heuristics to improve computational tractability. We demonstrate a fully automated layout design 4 pipeline that attempts to solve a more general formulation with greater geometric flexibility that 5 accounts for shading losses. Our approach generates rooftop areas from satellite imagery and uses 6 MINLP optimization to select panel positions, azimuth angles and tilt angles on an individual basis 7 rather than imposing any predefined layouts. Our results demonstrate that although several common 8 heuristics are often effective, they may not be universally suitable due to complications resulting 9 from geometric restrictions and shading losses. Finally, we evaluate a few specific heuristics from the 10 literature and propose a potential new rule of thumb that may help improve rooftop solar energy 11 potential when shading effects are considered.

translated by 谷歌翻译

Single image calibration using knowledge distillation approaches

Khadidja Ould Amer , Oussama Hadjerci , Mohamed Abbas Hedjazi , Antoine Letienne

分类：计算机视觉

2022-12-05

Although recent deep learning-based calibration methods can predict extrinsic and intrinsic camera parameters from a single image, their generalization remains limited by the number and distribution of training data samples. The huge computational and space requirement prevents convolutional neural networks (CNNs) from being implemented in resource-constrained environments. This challenge motivated us to learn a CNN gradually, by training new data while maintaining performance on previously learned data. Our approach builds upon a CNN architecture to automatically estimate camera parameters (focal length, pitch, and roll) using different incremental learning strategies to preserve knowledge when updating the network for new data distributions. Precisely, we adapt four common incremental learning, namely: LwF , iCaRL, LU CIR, and BiC by modifying their loss functions to our regression problem. We evaluate on two datasets containing 299008 indoor and outdoor images. Experiment results were significant and indicated which method was better for the camera calibration estimation.

translated by 谷歌翻译

Robust Transferable Feature Extractors: Learning to Defend Pre-Trained Networks Against White Box Adversaries

Alexander Cann , Ian Colbert , Ihab Amer

分类：机器学习 | 人工智能 | (统计)机器学习

2022-09-14

在计算机视觉应用中广泛采用深度神经网络引起了对对抗性鲁棒性的重大兴趣。现有的研究表明，专门针对给定模型量身定制的恶意扰动输入（即，对抗性示例）可以成功地转移到另一个受过独立训练的模型中，以引起预测错误。此外，这种对抗性示例的属性归因于数据分布中的预测模式得出的特征。因此，我们有动力调查以下问题：对抗性防御，例如对抗性例子，可以成功地转移到其他受过独立训练的模型中？为此，我们提出了一种基于深度学习的预处理机制，我们将其称为可鲁棒的可转移功能提取器（RTFE）。在研究了理论动机和含义后，我们在实验上表明，我们的方法可以为多个独立训练的分类器提供对抗性的鲁棒性，这些分类器原本是对自适应白盒对手的无效性。此外，我们表明RTFE甚至可以为在不同数据集中独立训练的模型提供单发对手的鲁棒性。

translated by 谷歌翻译

Fast Vocabulary Projection Method via Clustering for Multilingual Machine Translation on GPU

Hossam Amer , Young Jin Kim , Mohamed Afify , Hitokazu Matsushita , Hany Hassan Awadallah

分类：自然语言处理 | 机器学习

2022-08-14

使用变压器模型，多语言神经机器的翻译一直显示出巨大的成功。部署这些模型是具有挑战性的，因为它们通常需要各种语言的大词汇（词汇）尺寸。这限制了在上一个词汇投影层中预测输出令牌的速度。为了减轻这些挑战，本文提出了一种通过聚类的快速词汇投影方法，该方法可用于GPU上的多语言变压器。首先，我们脱机将词汇搜索空间分为不同的结合群，鉴于解码器输出的隐藏上下文向量，这导致词汇投影的词汇列要小得多。其次，在推理时，提出的方法预测了词汇投影中隐藏上下文向量的簇和候选候选代币。本文还包括对在多语言环境中构建这些群集的不同方式的分析。我们的结果表明，FLOAT16 GPU推断中的端到端速度增长高达25％，同时保持BLEU得分并略有增加记忆成本。所提出的方法将词汇投影步骤加速自身最多2.6倍。我们还进行了广泛的人类评估，以验证所提出的方法保留了原始模型的翻译质量。

translated by 谷歌翻译

ABCinML: Anticipatory Bias Correction in Machine Learning Applications

Abdulaziz A. Almuzaini , Chidansh A. Bhatt , David M. Pennock , Vivek K. Singh

分类：机器学习

2022-06-14

静态机器学习模型的理想化，经过训练并永远部署，这是不切实际的。随着输入分布的变化，该模型不仅会失去准确性，因此减少对受保护类别的偏见的任何约束都可能无法按预期工作。因此，研究人员已经开始探索随着时间的推移保持算法公平性的方法。一项工作重点是动态学习：每批次后重新训练，而另一个工作则介绍了强大的学习，该学习试图使算法与未来所有可能的变化进行鲁棒性。动态学习试图在发生后不久减少偏见，而健壮的学习通常会产生（过于）保守的模型。我们提出了一种预期的动态学习方法，用于纠正算法在发生偏见之前减轻算法。具体而言，我们利用有关下一个周期中人口亚组（例如，男性和女性申请人的相对比率）的相对分布的预期，以确定正确的参数，以实现重要性权衡方法。对多个现实世界数据集的实验的结果表明，这种方法有望预期偏差校正。

translated by 谷歌翻译

Efficiency Comparison of AI classification algorithms for Image Detection and Recognition in Real-time

Musarrat Saberin Nipun , Rejwan Bin Sulaiman , Amer Kareem

分类：计算机视觉 | 人工智能

2022-06-12

面部检测和识别是人工智能系统中最困难，经常使用的任务。这项研究的目的是介绍和比较系统中使用的几种面部检测和识别算法的结果。该系统始于人类的训练图像，然后继续进行测试图像，识别面部，将其与受过训练的面部进行比较，最后使用OPENCV分类器对其进行分类。这项研究将讨论系统中使用的最有效，最成功的策略，这些策略是使用Python，OpenCV和Matplotlib实施的。它也可以用于CCTV的位置，例如公共场所，购物中心和ATM摊位。

translated by 谷歌翻译

Combinations of Jaccard with Numerical Measures for Collaborative Filtering Enhancement: Current Work and Future Proposal

Ali A. Amer , Loc Nguyen

分类：人工智能

2021-11-24

协作过滤（CF）是推荐系统的重要方法，广泛应用于我们生命中的大量方面，在线的商业系统。 CF中的一个流行算法是K到最近邻居（KNN）算法，其中使用相似度测量来确定用户的最近邻居，从而量化相对用户/项目对之间的依赖程度。因此，CF方法不仅对相似性度量敏感，但它完全取决于对该措施的选择。虽然Jaccard - 作为CF任务的常用相似度措施之一 - 涉及评级的存在，余弦和皮尔逊等其他数值措施涉及评级的程度。特别说话，Jaccard不是一个主导的措施，但很长时间被证明是改善任何措施的重要因素。因此，在我们不断努力寻找最有效的CF相似性措施，本研究侧重于通过将Jaccard与多种数值措施相结合提出新的相似性度量。综合措施将采取存在和幅度的优点。电影镜头数据集的实验结果表明，综合措施是卓越的表现优于考虑的评估指标的所有单一措施。

translated by 谷歌翻译